AITopics | entrance exam

Collaborating Authors

entrance exam

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ChatGPT trounces humans in entrance exams for top Japan university, study finds

The Japan TimesApr-30-2026, 07:38:00 GMT

AI models surpassed the highest score recorded for a human test taker in this year's University of Tokyo entrance exam, a new study shows. If an artificial intelligence model such as ChatGPT had taken the entrance exams for Japan's top university in 2026, it would have been assessed as top of the class and admitted for scoring higher than any human test takers, a study by AI startup LifePrompt has found. The research used three major AI models -- ChatGPT 5.2 Thinking by OpenAI, Gemini 3 Pro Preview by Google and Claude Opus 4.5 by Anthropic -- and had them take the actual entrance exam used by the University of Tokyo in February 2026 to assess candidates for courses set to start in April. The university's category 3 science exam, often taken by those who want to enter the institution's medical school, is considered the most difficult exam to pass in Japan. In a time of both misinformation and too much information, quality journalism is more crucial than ever.

large language model, machine learning, natural language, (13 more...)

The Japan Times

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.89)

Industry:

Media > News (0.71)
Education > Assessment & Standards > Student Performance (0.56)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Shaping Explanations: Semantic Reward Modeling with Encoder-Only Transformers for GRPO

Pappone, Francesco, Lazzaroni, Ruggero Marino, Califano, Federico, Gentile, Niccolò, Marras, Roberto

arXiv.org Artificial IntelligenceSep-17-2025

While Large Language Models (LLMs) excel at generating human-like text, aligning their outputs with complex, qualitative goals like pedagogical soundness remains a significant challenge. Standard reinforcement learning techniques often rely on slow and expensive LLM-as-a-judge evaluations or on brittle, keyword-based metrics like ROUGE, which fail to capture the semantic essence of a high-quality explanation. In this work, we introduce a novel approach to reward shaping within the Group Relative Policy Optimisation (GRPO) framework. Our central contribution is the use of a small, efficient encoder-only transformer as a semantic reward model. This model provides a dense, semantically rich reward signal based on the cosine similarity between a generated explanation and a ground-truth reference, guiding the policy towards explanations that are not just factually correct but also structurally and conceptually aligned with expert reasoning. We apply this method to the task of training a model for the Italian medical-school entrance examinations, following standard domain-adaptive continued pre-training (CPT) and supervised fine-tuning (SFT). Our results demonstrate that GRPO with our proposed semantic reward significantly improves explanation faithfulness and clarity over a strong SFT baseline, showcasing the power of using lightweight encoder models for nuanced reward shaping in complex generation tasks

explanation, large language model, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2509.13081

Country: Europe (0.28)

Genre: Research Report > New Finding (0.54)

Industry:

Education (0.68)
Health & Medicine > Therapeutic Area (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

BLUEX Revisited: Enhancing Benchmark Coverage with Automatic Captioning

Santos, João Guilherme Alves, Bonás, Giovana Kerche, Almeida, Thales Sales

arXiv.org Artificial IntelligenceSep-1-2025

With the growing capabilities of Large Language Models (LLMs), there is an increasing need for robust evaluation methods, especially in multilingual and non-English contexts. W e present an updated version of the BLUEX dataset, now including 2024-2025 exams and automatically generated image captions using state-of-the-art models, enhancing its relevance for data contamination studies in LLM pretraining. Captioning strategies increase accessibility to text-only models by more than 40%, producing 1,422 usable questions, more than doubling the number in the original BLUEX. W e evaluated commercial and open-source LLMs and their ability to leverage visual context through captions.

caption, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2508.21294

Country: South America > Brazil (0.15)

Genre: Research Report (1.00)

Industry: Education > Educational Setting (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.76)

Add feedback

Evaluating GPT-4's Vision Capabilities on Brazilian University Admission Exams

Pires, Ramon, Almeida, Thales Sales, Abonizio, Hugo, Nogueira, Rodrigo

arXiv.org Artificial IntelligenceNov-23-2023

Recent advancements in language models have showcased human-comparable performance in academic entrance exams. However, existing studies often overlook questions that require the integration of visual comprehension, thus compromising the full spectrum and complexity inherent in real-world scenarios. To address this gap, we present a comprehensive framework to evaluate language models on entrance exams, which incorporates both textual and visual elements. We evaluate the two most recent editions of Exame Nacional do Ensino M\'edio (ENEM), the main standardized entrance examination adopted by Brazilian universities. Our study not only reaffirms the capabilities of GPT-4 as the state of the art for handling complex multidisciplinary questions, but also pioneers in offering a realistic assessment of multimodal language models on Portuguese examinations. One of the highlights is that text captions transcribing visual content outperform the direct use of images, suggesting that the vision model has room for improvement. Yet, despite improvements afforded by images or captions, mathematical questions remain a challenge for these state-of-the-art models. The code and data used on experiments are available at https://github.com/piresramon/gpt-4-enem.

caption, exam, language model, (15 more...)

arXiv.org Artificial Intelligence

2311.14169

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Switzerland (0.04)
Asia > Middle East > Jordan (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry: Education > Educational Setting > Higher Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ChatGPT is suddenly everywhere. Are we ready?

EngadgetFeb-3-2023, 18:00:31 GMT

For a product that its own creators, in a marketing pique, once declared "too dangerous" to release to the general public, OpenAI's ChatGPT is seemingly everywhere these days. The versatile automated text generation (ATG) system, which is capable of outputting copy that is nearly indistinguishable from a human writer's work, is officially still in beta but has already been utilized in dozens of novel applications, some of which extend far beyond the roles ChatGPT was originally intended for -- like that time it simulated an operational Linux shell or that other time when it passed the entrance exam to Wharton Business School. The hype around ChatGPT is understandably high, with myriad startups looking to license the technology for everything from conversing with historical figures to talking to historical literature, from learning other languages to generating exercise routines and restaurant reviews. But with these technical advancements come with a slew of opportunities for misuse and outright harm. And if our previous hamfisted attempts at handling the spread of deepfake video and audio technologies were any indication, we're dangerously underprepared for the havoc that at-scale, automated disinformation production will wreak upon our society.

chatgpt, nonnecke, openai, (16 more...)

Engadget

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > New York (0.04)
North America > United States > California > Orange County > Irvine (0.04)

Genre: Overview (0.34)

Industry:

Media (1.00)
Information Technology > Security & Privacy (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.52)

Add feedback

On The Reasons Behind Decisions

Darwiche, Adnan, Hirth, Auguste

arXiv.org Artificial IntelligenceFeb-21-2020

Recent work has shown that some common machine learning classifiers can be compiled into Boolean circuits that have the same input-output behavior. We present a theory for unveiling the reasons behind the decisions made by Boolean classifiers and study some of its theoretical and practical implications. We define notions such as sufficient, necessary and complete reasons behind decisions, in addition to classifier and decision bias. We show how these notions can be used to evaluate counterfactual statements such as "a decision will stick even if ... because ... ." We present efficient algorithms for computing these notions, which are based on new advances on tractable Boolean circuits, and illustrate them using a case study.

implicant, prime implicant, sufficient reason, (15 more...)

arXiv.org Artificial Intelligence

2002.09284

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Schools tapping smartphone and tablet apps to engage a new generation

The Japan TimesJan-11-2018, 05:40:12 GMT

Smartphone and tablet computer apps are seeing increasing use in Japanese schools as teachers look to capitalize on what has become many young people's preferred window to the world. Artificial intelligence-assisted apps have become prevalent in education, particularly in subjects many Japanese teachers struggle to teach well. One subject educators need help with is teaching English, a task that will become all the more important when speaking ability enters the joint achievement test in 2020, part of Japan's high-pressure university entrance exams. Nippon Sports Science University Kashiwa High School in Chiba Prefecture uses an app called TerraTalk to help students improve their English conversation skills. The school introduced the app last summer for use by students planning to study abroad.

app, artificial intelligence, student, (9 more...)

The Japan Times

Country:

Asia > Japan > Honshū > Kantō > Chiba Prefecture (0.25)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.07)
Asia > Japan > Honshū > Kantō > Saitama Prefecture > Saitama (0.05)

Industry: Education > Educational Setting > K-12 Education (0.74)

Technology:

Information Technology > Communications > Mobile (0.74)
Information Technology > Artificial Intelligence (0.56)

Add feedback

Robots Behaving Badly

#artificialintelligenceDec-28-2017, 17:51:47 GMT

Summary: For your holiday reading we present this selection of robot and AI fails. We hope this brings you hope and cheer for the coming year to know that our robot overlords are not as close as some think. For your holiday reading we present this selection of robot and AI fails. We hope this brings you hope and cheer for the coming year to know that our robot overlords are not as close as some think. Alas, it seems we've got a few more years before the robots take over.

alexa, artificial intelligence, robot, (18 more...)

#artificialintelligence

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.05)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(3 more...)

Industry: Information Technology (0.96)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.55)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.48)

Add feedback

Video Friday: Powered Exoskeleton, Drone Shows, and Soft Robotic Mask

IEEE Spectrum RoboticsSep-1-2017, 18:01:01 GMT

Video Friday is your weekly selection of awesome robotics videos, collected by your Automaton bloggers. We'll also be posting a weekly calendar of upcoming robotics events for the next two months; here's what we have so far (send us your events!): Let us know if you have suggestions for next week, and enjoy today's videos. I don't know much about this powered partial exoskeleton called KOMA, except that the company behind it (ATOUN, from Japan) says that it's designed to help you carry very heavy objects in a way that won't interfere with your natural movements. Jiří Zemánek and Martin Gurtner from the Czech Technical University in Prague won first place in the IEEE CSS video contest (awarded at the IEEE CCTA 2017 conference) for their video demonstrating numerical optimal control on a "flying ball in a hoop" system: The IEEE CCTA Conference, incidentally, was held on the Kohala Coast in Hawaii, where as far as I know we have not had a major robotics conference recently.

artificial intelligence, robot, video friday, (13 more...)

IEEE Spectrum Robotics

Country:

North America > United States > Hawaii (0.25)
Europe > Czechia > Prague (0.25)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
(5 more...)

Industry: Government (0.31)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Can a robot pass a university entrance exam?

#artificialintelligenceAug-31-2017, 05:40:30 GMT

Meet Todai Robot, an AI project that performed in the top 20 percent of students on the entrance exam for the University of Tokyo -- without actually understanding a thing. While it's not matriculating anytime soon, Todai Robot's success raises alarming questions for the future of human education. How can we help kids excel at the things that humans will always do better than AI? Could an AI pass the entrance exam for the University of Tokyo? Noriko Arai oversees a project that wants to find out. Could an AI pass the entrance exam for the University of Tokyo?

artificial intelligence, entrance exam, university entrance exam, (4 more...)

#artificialintelligence

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.91)

Industry: Education (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (0.98)

Add feedback